GeoPCA: a new tool for multivariate analysis of dihedral angles based on principal component geodesics

نویسندگان

  • Karen Sargsyan
  • Jon Wright
  • Carmay Lim
چکیده

The GeoPCA package is the first tool developed for multivariate analysis of dihedral angles based on principal component geodesics. Principal component geodesic analysis provides a natural generalization of principal component analysis for data distributed in non-Euclidean space, as in the case of angular data. GeoPCA presents projection of angular data on a sphere composed of the first two principal component geodesics, allowing clustering based on dihedral angles as opposed to Cartesian coordinates. It also provides a measure of the similarity between input structures based on only dihedral angles, in analogy to the root-mean-square deviation of atoms based on Cartesian coordinates. The principal component geodesic approach is shown herein to reproduce clusters of nucleotides observed in an η-θ plot. GeoPCA can be accessed via http://pca.limlab.ibms.sinica.edu.tw.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An application of principal component analysis and logistic regression to facilitate production scheduling decision support system: an automotive industry case

Production planning and control (PPC) systems have to deal with rising complexity and dynamics. The complexity of planning tasks is due to some existing multiple variables and dynamic factors derived from uncertainties surrounding the PPC. Although literatures on exact scheduling algorithms, simulation approaches, and heuristic methods are extensive in production planning, they seem to be ineff...

متن کامل

Modeling of Banks ‌Bankruptcy in Iran (Multivariate Statistical Analysis)

In this paper we construct a modeling for detection of banks which are experiencing serious problems. Sample and variable set of the study contains 30 banks of Iran during 2006-2014 and their financial ratios. Well known multivariate statistical technique (principal component analysis) was used to explore the basic financial characteristics of the banks, and discriminant Logit and Probit models ...

متن کامل

Principal component geodesics for planar shape spaces

In this paper a numerical method to compute principal component geodesics for Kendall’s planar shape spaces which are essentially complex projective spaces is presented. Underlying is the notion of principal component analysis based on geodesics for non-Euclidean manifolds as proposed in an earlier paper by Huckemann and Ziezold (2006). Currently, principal component analysis for shape spaces i...

متن کامل

Clustering of Multivariate Time-Series Data

A new methodology for clustering multivariate time-series data is proposed. The methodology is based on calculation of the degree of similarity between multivariate time-series datasets using two similarity factors. One similarity factor is based on principal component analysis and the angles between the principal component subspaces while the other is based on the Mahalanobis distance between ...

متن کامل

Application of Genetic Algorithms for Pixel Selection in MIA-QSAR Studies on Anti-HIV HEPT Analogues for New Design Derivatives

Quantitative structure-activity relationship (QSAR) analysis has been carried out with a series of 107 anti-HIV HEPT compounds with antiviral activity, which was performed by chemometrics methods. Bi-dimensional images were used to calculate some pixels and multivariate image analysis was applied to QSAR modelling of the anti-HIV potential of HEPT analogues by means of multivariate calibration,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 40  شماره 

صفحات  -

تاریخ انتشار 2012